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Abstract 

We consider a multi-user network where a network manager and selfish users interact. The network 
manager monitors the behavior of users and intervenes in the interaction among users if necessary, while 
users make decisions independently to optimize their individual objectives. In this paper, we develop a 
framework of intervention mechanism design, which is aimed to optimize the objective of the manager, 
or the network performance, taking the incentives of selfish users into account. Our framework is general 
enough to cover a wide range of application scenarios, and it has advantages over existing approaches such 
as Stackelberg strategies and pricing. To design an intervention mechanism and to predict the resulting 
operating point, we formulate a new class of games called intervention games and a new solution concept 
called intervention equilibrium. We provide analytic results about intervention equilibrium and optimal 
intervention mechanisms in the case of a benevolent manager with perfect monitoring. We illustrate 
these results with a random access model. Our illustrative example suggests that intervention requires 
less knowledge about users than pricing. 

Index Terms 

Game theory, incentives, intervention, mechanism design, multi-user networks, network management 

I. Introduction 

A. Motivation and Our Approach 

In noncooperative multi-user networks, multiple users interact with each other making decisions inde- 
pendently to optimize their individual objectives. Operation by selfish users often results in a suboptimal 
network performance, and this calls for an incentive mechanism to guide selfish users to behave according 
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to the system objective. In this paper, we propose a class of incentive mechanisms, called intervention 
mechanisms, that can be used when there is a network manager that can monitor the behavior of users 
and intervene in the interaction among users if necessary. Since the manager can react to the behavior 
of users through an intervention mechanism, an intervention mechanism has the potential to shape the 
incentives of users. 

To design an intervention mechanism and to predict the resulting operating point, we formulate a new 
class of games called intervention games and a new solution concept called intervention equilibrium. In 
an intervention game, the manager first chooses an intervention mechanism, and then users choose their 
actions knowing the intervention mechanism chosen by the manager. After observing a signal about the 
actions of users, the manager chooses its action according to the intervention mechanism. An intervention 
mechanism specifies an action that the manager takes following each possible signal realization, and thus 
can be considered as a protocol that prescribes a rule according to which the manager rewards and 
punishes users depending on their observed behavior. The manager chooses an intervention mechanism 
to optimize its objective, anticipating the rational behavior of users given the intervention mechanism it 
chooses. Intervention equilibrium predicts the outcome of an intervention game in terms of an intervention 
mechanism designed by the manager and an operating point chosen by users. 

B. Related Work 

An approach similar to ours can be found in (2. The authors of ID consider a congestion control 
problem and analyze performance under different scheduling mechanisms, which can be considered as 
intervention mechanisms. With a scheduling mechanism that assigns a higher priority to flows with a 
smaller rate, the input rates of users can be restrained voluntarily. In other words, users do not send 
their traffic excessively in their self-interest because doing so will result in a higher probability that 
their packets are dropped. Another closely related work is our previous work |2]. In O, we consider a 
random access network where the manager can intervene by jamming the packets of users and show that 
intervention mechanisms can successfully control the transmission probabilities of selfish users. However, 
these works consider only specific communication networks. In this paper, we aim to develop a general 
framework of intervention mechanisms that can be applied to various application scenarios including not 
only communication networks but also other scenarios such as file sharing on peer-to-peer networks, 
multi-task processing on cloud computers, and load balancing on multi-core processors. 

In an intervention game, the manager makes a decision before users do, and thus the manager can be 
considered as a leader and users as followers. Such a hierarchical structure of multi-user interaction has 
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been modeled as a Stackelberg game in the context of congestion control 0, medium access control 
H, and power control Q. The main difference between our intervention approach and the traditional 
Stackelberg approach can be found in the ability of the manager. In an intervention game, the manager 
can monitor the actions of users and can commit to an intervention mechanism. Hence, a strategy of 
the manager is its complete contingent plan for actions to be taken given a signal realization. On the 
other hand, in the Stackelberg games in (3l-||5l, the leader does not have monitoring and commitment 
capabilities, and thus simply chooses an action before followers choose their actions. Our formulation 
includes the traditional Stackelberg formulation as a special case, because an intervention mechanism 
reduces to an action if the manager chooses the same action regardless of signal realizations or if it 
observes no signal. Therefore, intervention games yield higher design flexibility for the manager than 
traditional Stackelberg games do, enabling the greater potential to shape the incentives of users. 

Pricing mechanisms that charge users for their usage have received a significant amount of attention 
in the literature O. Pricing mechanisms have a solid theoretical foundation in economics as well as 
high design flexibility. They can be considered as a special class of intervention mechanisms, where the 
intervention of the manager corresponds to charging and collecting payments from users. The existing 
literature has studied pricing mechanisms using Stackelberg games in the context of congestion control 
and cognitive radio networks (H. 

C. Focus and Advantages of Our Approach 

Our discussion so far suggests a classification of intervention into direct and indirect intervention. 
Intervention is direct (resp. indirect) if the manager interacts with users in the inside (resp. outside) of 
the network. In other words, the manager with direct intervention intervenes in the network usage of 
users, while the manager with indirect intervention influences the utilities of users through an outside 
instrument, for example, monetary payments in the case of pricing. Direct intervention can be further 
classified into adaptive and constant intervention, depending on whether the manager can react to the 
actions of users. Traditional Stackelberg strategies where the manager takes an action before users do 
belong to constant intervention. Table U classifies the existing intervention approaches in the literature 
according to the above criteria. 

Although our framework covers all the three classes of intervention in Table H our main focus is 
on adaptive direct intervention, which has not received much attention from researchers compared to 
the other two classes. Adaptive direct intervention is useful compared to constant direct intervention 
especially when the manager does not value its usage of the network. In this scenario, the manager 
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TABLE I 

Classification of the Existing Intervention Approaches in the Literature. 
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Indirect intervention 
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Adaptive 


Constant 
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desires to achieve a certain operating point with the minimum level of intervention. With constant direct 
intervention the manager needs to consume network resources in order to impact the behavior of users, 
whereas with adaptive direct intervention the manager can use intervention only as a threat to punish 
users in case of deviation. Direct intervention has an advantage over indirect intervention in terms of 
implementation. Since direct intervention affects the network usage of users directly, users cannot evade 
intervention as long as they use the network. On the contrary, enforcing the outside instrument of indirect 
intervention can be costly especially when the network is freely accessible. The difference between direct 
and indirect intervention yields an informational advantage of direct intervention over indirect intervention. 
For instance, direct intervention in 12 and O affects the data rates of users. Hence, the effectiveness of 
a direct intervention mechanism is independent of the way users value their data rate. On the contrary, 
designing a pricing mechanism to achieve a certain operating point in general requires the manager to 
know the utility functions of users. This point is illustrated with an example in Section [V] 

D. Comparison with Other Frameworks 

We first compare the framework of intervention mechanism design with that of network utility max- 
imization (NUM) (9]|. Since our framework allows a general form of the objective of the manager, 
the objective can be set as the sum of the utilities of users, as in NUM. The main difference between 
intervention and NUM is that intervention takes into account the incentives of selfish users whereas NUM 
assumes obedient users. The NUM framework aims to design a distributed algorithm following which 
obedient users can reach a system-wide optimal operating point, using prices as congestion signals. On 
the contrary, the intervention framework aims to design an incentive mechanism that induces selfish users 
to achieve an incentive-constrained optimal operating point. Hence, the intervention framework is more 
relevant in a network with selfish users. 

Next, we compare intervention games and repeated games iflOl . which share a common goal of 
sustaining cooperation among selfish individuals. With a repeated game strategy such as tit-for-tat, users 
monitor the actions of other users and choose their actions depending on their past observations. Hence, 
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the burden of monitoring and executing reward and punishment is distributed to users in a repeated game, 
while it is imposed solely on the manager in an intervention game. If there is no reliable manager in 
the network, a mechanism based on a repeated game strategy is a possible alternative to an intervention 
mechanism. However, in order to implement a repeated game strategy, users in the network must interact 
frequently and maintain histories of their observations. Moreover, sustaining an optimal operating point 
with a repeated game strategy often requires users to be sufficiently patient. On the contrary, implementing 
intervention mechanisms effectively requires neither repeated interaction nor patience of users. 

Lastly, we compare intervention mechanism design with standard mechanism design iTTTI . |[T2l . A 
standard mechanism design problem considers a scenario where the manager can control the system 
configuration but has incomplete information about the types of users. Thus, standard mechanism design 
is concerned about incentives regarding types, whereas intervention mechanism design in this paper 
cares about incentives regarding actions. In economics terminology, standard mechanism design focuses 
on adverse selection while our intervention mechanism design focuses on moral hazard ifTTI . We can 
modify or extend our framework to study the capabilities and limitations of intervention mechanisms 
to overcome incentive problems in the presence of only adverse selection or both adverse selection and 
moral hazard. We leave these topics for future research. 

E. Organization of the Paper 

The remainder of this paper is organized as follows. In Section II, we formulate the framework of 
intervention mechanism design, defining intervention games and intervention equilibrium. In Section III, 
we classify the types of intervention and the manager to point out the broad scope of our framework. In 
Section IV, we consider a benevolent manager with perfect monitoring and provide analytic results about 
intervention equilibrium. In Section V, we apply our framework to random access networks in order to 
illustrate the results. We conclude in Section VI. 

II. Framework 

We consider a multi-user network where users interact with each other taking actions independently. 
There is a network manager that can intervene in the interaction of users. There are N users in the 
network, and the set of users is denoted by M = {1, ... ,N}. For convenience, we call the manager 
user 0. To distinguish the users from the manager, we sometimes refer to a user as a regular user. The 
regular users are indexed from user 1 to user N. The set of all the users including the manager is denoted 
by Mq = M U {0}. Each user i € A/q chooses an action from the set of actions available to it, which 
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is denoted by A{. An action profile of the regular users is written as a = (ai , . . . , ajv), while the set of 
action profiles of the regular users is written as A = JT ie/V -74j. An action profile of the regular users 
other than regular user i is written as a_j = (ax, ... , aj_ l5 a i+1 , . . . , ajv) so that a can be expressed as 

a = (04, a_j). 

The actions of the manager and the users jointly determine utilities they receive. The utility function 
of user i £ A/o is denoted by Uj : Ao x A — > M. That is, itj(ao,a) represents the utility that user i 
receives when the manager chooses action ao and the users choose an action profile a. The manager is 
different from the regular users in that the manager is able to monitor the actions of the regular users 
before it takes its action. The monitoring ability of the manager is formally represented by a monitoring 
technology (S,p). S is the set of all possible signals that the manager can obtain, while p is a mapping 
from A to A (S), where A (S) is the set of probability distributions over S. That is, p(a) represents the 
probability distribution of signals when the regular users choose a. 

Since the manager takes its action after observing a signal, a strategy for the manager is its complete 
contingent plan for actions to be taken following all possible signal realizations while a strategy for a 
regular user is simply its choice of an actionQ Thus, a strategy for the manager can be represented by 
a function / : S — > Ao, which we call an intervention mechanism. That is, an intervention mechanism 
specifies an action taken by the manager following each signal it can observe. Let T be the set of all 
intervention mechanisms, i.e., the set of all functions from S to Aq. We assume that the manager is able 
to commit to an intervention mechanism and that the intervention mechanism chosen by the manager is 
known to the regular users when they choose their actions^ 

To sum up, an intervention game is summarized by the data 

G = (A/o, (A)ieAT , (ui)ieM i (S,p)) , 

and the timing of an intervention game can be described as follows. 

1) The manager chooses an intervention mechanism / € T . 

2) The regular users choose their actions a € A independently and simultaneously, knowing the 
intervention mechanism / chosen by the manager. 

3) The manager observes a signal s € S, which is realized following a probability distribution p(a) e 
A(S). 

'For expositional simplicity, we restrict attention to pure strategies, although our formulation extends easily to the case of 
randomized strategies. 

2 The information about the intervention mechanism can be obtained by an agreed protocol specification or by learning. 
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4) The manager chooses its action ao G Aq according to /, i.e., ao = f(s). 

5) The users receive their utilities based on the chosen actions (ao,a) G Ao x A. 
Consider an intervention mechanism / G T, and define a function v\ : A —¥ M. by 

v{(a) = E p{a) [ Ui (f(s),a)}, (1) 

for all i G M, where E p / a \ [•] denotes expectation with respect to the random variable s following the 
distribution p(a). Then v{ (a) represents the expected utility that regular user i receives when the manager 
uses intervention mechanism / and the regular users choose action profile a. An intervention mechanism 
/ induces a simultaneous game played by the regular users, whose normal form representation is given 
by 

We can predict actions chosen by the selfish regular users given an intervention mechanism / by applying 
the solution concept of Nash equilibrium to the induced game Qf. 

Definition 1: An intervention mechanism / G T supports the action profile a* G A of the regular 
users if a* is a Nash equilibrium of the game Qf, i.e., 

v{ (a*,a*_A > v{ (a*, a*_A for all a« G A iy for all i G M. 

Also, / strongly supports a* if a* is a unique Nash equilibrium of Qf. 

If an action profile a* is supported by an intervention mechanism /, the regular users cannot gain by 
a unilateral deviation from a* as long as the manager uses intervention mechanism /. When choosing an 
intervention mechanism, the manager can expect that the regular users will choose an action profile that 
is supported by the intervention mechanism it chooses. However, when there are multiple action profiles 
supported by the intervention mechanism, the manager may be uncertain about the action profile chosen 
by the regular users. This uncertainty disappears when the intervention mechanism strongly supports an 
action profile. In the formulation of our solution concept, we assume that the regular users will always 
select the best Nash equilibrium for the manager if the induced game has multiple Nash equilibria. 

Definition 2: (f*,a*) G T x A is an (strong) equilibrium of intervention game Q, or an (strong) 
intervention equilibrium, if /* (strongly) supports a* and 

E P (a*)[ u o(f*{s),a*)} > E p{a) [u {f{s),a)} for all (f,a) G T x A such that / supports a. 

f* G T is an (strongly) optimal intervention mechanism if there exists an action profile a* G A such that 
{f*, a*) is an (strong) intervention equilibrium. 
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Intervention equilibrium is a solution concept for intervention games, based on a backward induc- 
tion argument, assuming that the manager can commit to an intervention mechanism and predict the 
rational reaction of the regular users to its choice of an intervention mechanism. An intervention equi- 
librium can be considered as a Stackelberg equilibrium (or a subgame perfect equilibrium) applied 
to an intervention game Q, since the induced game 5/ is a subgame of Q. Since we assume that 
the manager can induce the regular users to choose the best Nash equilibrium for it in the case of 
multiple Nash equilibria, the problem of designing an optimal intervention mechanism can be expressed 
as maxjgjr max ag g(y) E p r a \ [uo(f(s), a)], where £ (/) denotes the set of action profiles supported by /. 



In this section, we classify the types of intervention depending on the form of the utility functions 
of the regular users (i.e., the way the manager interacts with the regular users) and the types of the 
manager depending on the form of the utility function the manager (i.e., the objective of the manager). 
The purpose of classification is to suggest that a wide range of application scenarios can be modeled in 
the framework of intervention. 

Definition 3: Let A, C M> K for some K, for all i G Mq. Intervention is additively symmetric if, for 
each i G M, there exist functions gfj : R — > R and h\ : R 2 -> R, for k = 1, . . . , K and for j G Wo \ {i}, 
such that 



there exist functions gf, : R ->■ R and h\ : R 2 ->• R, for k = 1, . . . , K and for j G A/q \ {i}, such that 



Intervention is symmetric if it is either additively or multiplicatively symmetric. 

The manager with symmetric intervention uses the network as the regular users do, in the same 
position as the regular users. The effect of the actions of other uses including the manager on a regular 
user is represented by an additive term in the case of additive symmetric intervention and a multiplicative 
term in the case of multiplicatively symmetric intervention. Network models in which the manager can 
exert additive symmetric intervention include routing Q and frequency-selective Gaussian interference 
channels |5]. Multiplicative symmetric intervention can be used in the random access model of O. 



III. Classification of Intervention and the Manager 




where denotes the k-th element of a» 



Intervention is multiplicatively symmetric if, for each i G J\f, 
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We can interpret the objective of the manager as the system objective, which can vary depending on 
the types of the manager. Below we classify the manager with symmetric intervention depending on the 
form of its utility function. 

Definition 4: Consider symmetric intervention. The manager is benevolent if 

u (a ,a) = ^ WiUi(a ,a) 

for some (w\, ... ,wn) £ The manager is self-interested if there exist functions : M — y R and 
/ig : M 2 ->• K, for k = 1, . . . , if and for j G A/", such that 

u (a 0l a) = J2 h o ( a tJ2 Soj( a j) ] ( 2 ) 
in the case of additively symmetric intervention, and 

u {a ,a) = Y, h o\ a tn 9oj (aj) J (3) 
fe=i \ jeV / 

in the case of multiplicatively symmetric intervention. The manager is total welfare maximizing if 

u (a ,a) = w u (a ,a) + WiUi(ao, a) 

for some (ioo,wi, • • • , iojv) € where uo( a Oi a ) denotes the right-hand side of (O if intervention 

is additively symmetric and that of (0 if intervention is multiplicatively symmetric. 

A benevolent manager maximizes the welfare of the regular users, attaching a welfare weight Wi to 
regular user i. The utility function of a self-interested manager has the same structure as those of the 
regular users. Recall that the manager with symmetric intervention uses the network as the regular users 
do. A benevolent manager does not derive any utility from its usage of the network, whereas the utility 
of a self-interested manager is derived solely from its usage |5]. A total welfare maximizing manager 
derives utility from its usage as well as that of the other users O. 

It is also possible that the manager plays a special role in the network, participating in interaction in a 
different way from the regular users. For example, the manager plays the role of a scheduler in (2 and a 
billing authority in pricing (71, (H. We call this type of intervention asymmetric intervention to contrast it 
with symmetric intervention^ We present two representative examples of asymmetric intervention, which 
arise naturally in games with transferable utility [13]. 



indirect intervention is necessarily asymmetric, while direct intervention can be symmetric or asymmetric. 
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Definition 5: Intervention is asymmetric if it is not symmetric. Let Aq = Mr? . Intervention is additively 
asymmetric if, for each i £ M, there exists function gi : A — > R such that 

Ui(a ,a) = gi(a) - a , 

where a is the i-th element of ao- Let Aq = [0, 1]^. Intervention is multiplicatively asymmetric if, for 
each i G TV, there exists function gi : A —> R + such that 

Ui(a ,a) = (1 - ao)gj(a). 

With additively asymmetric intervention, the manager takes away the amount a l from the benefit gi (a) 
that regular user i receives from its usage of the network. With multiplicatively asymmetric intervention, 
the manager deducts the a l fraction of the benefit that regular user i receives. Hence, the two types 
of asymmetric intervention in Definition [5] can be compared to two different forms of taxation, where 
additively asymmetric intervention corresponds to taxation with lump-sum tax rates and multiplicatively 
asymmetric intervention to taxation with proportional tax rates. 

Definition 6: Consider additively or multiplicatively asymmetric intervention. Let Co : A — > R + be a 
function that represents the operating cost of the network. The manager is benevolent if 

n (a ,a) = ^Uj(a ,a)- 

The manager is self-interested if 

uo(ao,a) = -c (a) 

in the case of additively asymmetric intervention, and 

u (a , a) = a l gi(a) - c (a) 

in the case of multiplicatively asymmetric intervention. The manager is total welfare maximizing if 

u (a , a) = 22 9i{a) ~ c (a). 

In Definition [6l we use welfare weights wi = 1 for all i £ No because of the assumption of transferable 
utility. For ease of interpretation, let us regard the transfer from the regular users to the manager as 
payments. A benevolent manager does not value payments (or burns payments) it receives from the regular 
users, and thus payments create a welfare loss. On the contrary, a self-interested manager values payments 
from the regular users, and it maximizes its profit measured by the total payment minus the operating 
cost. If there is no operating cost, i.e., cq = 0, then the objective of a self-interested manager is revenue 
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maximization. A total welfare maximizing manager maximizes the net gain from operating the network, 
which is the total benefit of the regular users minus the operating cost. We can also consider an individually 
rational benevolent manager that has utility function uo(a , a) = J2iej\f u i( a o, a ) an(I faces the individual 
rationality constraint that requires the total payment it receives to be no less than the operating cost (i.e., 
SieAf a o > co (a) in the case of additively asymmetric intervention and J2ieAf a o9i( a ) — c o( a ) i n the 
case of multiplicatively asymmetric intervention). In case that a regular user has an outside option, the 
individual rationality constraint for the regular user can be taken care of by including in its action space 
an action that corresponds to choosing its outside option. 

IV. Benevolent Manager with Perfect Monitoring 

In this section, we analyze a class of intervention games that satisfy the following maintained assump- 
tions, while leaving the analysis of other classes for future work. 

Assumption 1: (i) (Benevolent manager) The utility function of the manager is given by uo(ao,a) = 
J2i&Af u i( a o,a) for all (a Q ,a) G A X A. 

(ii) (Existence of minimal and maximal intervention actions) There exist the minimal and maximal 
elements of Aq, denoted a and ao, respectively, in the sense that for all i G M, a and ao satisfy 

Ui(a , a) > Ui(ao, a) > iij(ao, a) for all ao G Aq, for all a £ A. 

(iii) (Perfect monitoring) The monitoring technology of the manager is perfect in the sense that the 
manager can observe the actions of the regular users without errors. Formally, a monitoring technology 
(S, p) is perfect if S = A and only signal a can arise in the distribution p(a) for all a G A. 

In Assumption QJi), we set welfare weights as W{ = 1 for all i G Af in order to cover both cases of 
transferable and nontransferable utility, although our results extend easily to general welfare weights. In 
Assumption [TJii), a and ao can be interpreted as the minimal and maximal intervention actions of the 
manager, respectively. For given a € A, each regular user receives the highest (resp. lowest) utility when 
the manager takes the minimal (resp. maximal) intervention action. In other words, the utilities of the 
regular users are aligned with respect to the action of the manager so that the manager can reward or 
punish all the regular users at the same time. Combining Assumption QJi) and (ii), we obtain 

«o(a ,a) > uo(ao,a) > uo(ao,a) for all ao G Aq, for all a G A. (4) 

Thus, for given a G A the benevolent manager prefers to use the minimal intervention action. With 
perfect monitoring, vf defined in (Q]) reduces to v{ (a) = Ui(f(a), a) for all a G A. 
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We first characterize the set of action profiles of the regular users that can be supported by an 
intervention mechanism. Define £ = U/ 6 jr£(/) = {a E A : 3f E T such that / supports a}. 

Proposition 1: a* E £ if and only if Uj(a , a*) > Ui(aQ, a^, a*_A for all a, E .Aj, for all i E A/". 

Proof: Suppose that iij(a , a*) > Uj(ao, Oj, alj) for all <2j E Aj, for all i E A/*. Define an intervention 
mechanism /a, for each a G i, by 



Then f a * supports a*, and thus a* E £. 

Suppose that a* E £. Then there exists an intervention mechanism / such that Ui(f(a*),a*) > 
Ui(f(ai, alj), Oj, a*_j) for all E Aj, for all i E A/". Then we obtain itj(a ,a*) > Ui(f(a*),a*) > 
Ui{f{ai,a*_ i ),ai,a*_ i ) > Uj(ao, Oi, a*_j) for all a» E A^, for all i E M, where the first and the third 



The basic idea underlying Proposition Q] is that for given a* E A, f a * is the most effective intervention 
mechanism to support a*. Thus, in order to find out whether a* is supported by some intervention 
function, it suffices to check whether a* is supported by / a «. We call /„, defined in ([5]), the maximum 
punishment intervention mechanism with target action profile a, because the manager using fa takes 
the maximum intervention action whenever the users do not follow the action profile a. Let T p be the 
set of all maximum punishment intervention mechanisms, i.e., T v = {f a E T : a E ^4}. Also, define 
£ p = (Jf e jr P £(f) = {a E A : 3f E T v such that / supports a}. The second part of the proof of 
Proposition Q] shows that if o* is supported by some intervention mechanism /, then it is also supported 
by the maximum punishment intervention mechanism with target action profile a*, f a *. This observation 
leads us to the following corollary. 

Corollary 1: (i) £ = £P. 
(ii) If (f*,a*) is an intervention equilibrium, then (/ a *,a*) is also an intervention equilibrium. 
Proof: (i) £ D £ p follows from T D T p , while £ C £ p follows from Proposition [TJ 

(ii) Suppose that (/*,o*) is an intervention equilibrium. Then by Definition [2j /* supports a*, and 
li o(/*(a*)) a*) > u o(/(a))fl) for all (/,a)eJxi such that / supports a. Since a* E £, / a * supports 
a* by Proposition Q] Hence, uo(f*(a*), a*) > uo(f a -(a*),a*). On the other hand, since f a *(a*) = a , 
we have uo(f*(a*),a*) < uo(f a *(a*),a*) by ©. Therefore, u (f*(a*),a*) = uo(f a *(a*),a*), and thus 
' i o(/a*( ffi *) ) fl*) > uo(f(a),a) for all (/, a) E T x A such that / supports a. This proves that (/ a »,a*) 
is an intervention equilibrium. ■ 




(5) 



inequalities follow from Assumption Qlii). 
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Corollary Q] shows that there is no loss of generality in two senses when we restrict attention to 
maximum punishment intervention mechanisms. First, the set of action profiles that can be supported by 
an intervention mechanism remains the same when we consider only maximum punishment intervention 
mechanisms. Second, if there exists an optimal intervention mechanism, we can find an optimal inter- 
vention mechanism among maximum punishment intervention mechanisms. The following proposition 
provides a necessary and sufficient condition under which a maximum punishment intervention mechanism 
together with its target action profile constitutes an intervention equilibrium. 

Proposition 2: (f a . ,a*) is an intervention equilibrium if and only if a* G £ and uq(o , a*) > 110(0$, a) 
for all a G £. 

Proof: Suppose that (f a *,a*) is an intervention equilibrium. Then f a * supports a*, and thus a* G £. 
Also, uo(f a *(a*),a*) > uo(f(a),a) for all (/, a) G F x A such that / supports a. Choose any a G £ . 
Then by Proposition Q] f a supports a, and thus uo(ao> a *) = u o(fa* (a*), «*) > u o(fa(a),a) = U( j (a ,a). 

Suppose that a* G £ and uo(a , a*) > uo(a , a) for all a G £. To prove that (f a * ,a*) is an intervention 
equilibrium, we need to show (i) / a . supports a*, and (ii) uo(f a *(a*),a*) > uo(f(a),a) for all (/, a) G 
J 7 x A such that / supports a. Since a* G £, (i) follows from Proposition [TJ To prove (ii), choose any 
{f, a) G T x A such that / supports a. Then uo(f a *(a*),a*) = iio(ao' a *) — n o(oo' a ) — u o{fi a )i a )> 
where the first inequality follows from a G £. ■ 

Proposition [2] implies that if we obtain an action profile a* such that a* G arg max a€ £ uo(oq, a), 
we can use it to construct an intervention equilibrium and thus an optimal intervention mechanism. 
Corollary [jjii) implies that, when we want to find out whether a given action profile can be supported 
by an optimal intervention mechanism, we can consider only the maximum punishment intervention 
mechanism having the action profile as its target action profile. However, when we are given an optimal 
intervention mechanism f a , in the class of maximum punishment intervention mechanisms, it is not certain 
whether its target action profile a* or some other action profile constitutes an intervention equilibrium 
together with f a * . The following proposition provides a sufficient condition under which a given optimal 
intervention mechanism f a . must be paired with its target action profile a* to form an intervention 
equilibrium. 

Proposition 3: Suppose that uo(a , a) > uo(ao, a) for all a G £. If f a * G F p is an optimal intervention 
mechanism, then (/ » , a*) is an intervention equilibrium and there exists no other a 7^ a* such that (/ a » , a) 
is an intervention equilibrium. 

Proof: Suppose that f a * G T v is an optimal intervention mechanism. Then there must exist a' £ A 
such that (fa*, a') is an intervention equilibrium, i.e., (i) / Q » supports a', and (ii) uo(f a *(a'),a') > 
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uo(f(a),a) for all (/, a) G T x A such that / supports a. Suppose that there exists a' ^ a* that satisfies 
(i) and (ii). Since f a * supports a', we have a' € 8 and thus f a > supports a'. Then, by (ii), uo(ao,a') = 
uo{f a *W),a') > uo{fa'(a'),a') = uo(a , a'), which contradicts the assumption that uo(a , a) > uo(ao,a) 
for all a €f £. Therefore, there cannot exist a' ^ a* that satisfies (i) and (ii). Since there must exist a' £ A 
that satisfies (i) and (ii), a* must satisfy (i) and (ii). ■ 

Although maximum punishment intervention mechanisms have a simple structure and are most ef- 
fective in supporting a given action profile, they also have weaknesses. First, even when (/ a *,«*) is an 
intervention equilibrium, there may be other action profiles that are also supported by f a * . For example, in 
the case of multiplicatively asymmetric intervention where we have a = (0, . . . , 0) and Tiq = (1, . . . , 1), 
f a * supports any action profile a' that has at least two different elements from those of a*. At such 
an action profile, the regular users receive zero utility. As long as uo(a ,a*) > 0, a* Pareto dominates 
other action profiles that are supported by /„*, and this can provide a rationale that the regular users 
will select a* among multiple Nash equilibria of the game induced by f a *. However, in principle, the 
regular users may select any Nash equilibrium, and the manager cannot guarantee that the regular users 
will choose the intended target action profile a* when there are other Nash equilibria. Strongly optimal 
intervention mechanisms have robustness in that they yield a unique Nash equilibrium and thus the 
issue of multiple Nash equilibria does not arise. As mentioned in Section II, an implicit assumption 
underlying the concept of intervention equilibrium is that the regular users will always select the best 
Nash equilibrium for the manager in the induced game. If the manager takes a conservative approach, 
it may assume that the regular users will choose the worst Nash equilibrium for it. In this case, an 
optimal intervention mechanism solves max^ g jr min ag £(j) E p t a \ [«o(/(s), a)], which has a similar spirit 
as maximin strategies lfT3l in non-zero-sum games. 

Another weakness of maximum punishment intervention mechanisms is that they may incur a large 
efficiency loss when there are errors in the system. For example, when a regular user chooses an action 
different from the one it intends to take by mistake (i.e., trembling hand) or when the manager receives 
an incorrect signal (i.e., noisy observation), the maximal intervention action is applied even if the regular 
users (intend to) choose the target action profile. The case of noisy observation can be covered by modeling 
the monitoring technology of the manager as imperfect monitoring. In order to overcome this weakness 
in the case of perfect monitoring, we can consider a class of continuous intervention mechanisms (i.e., 
intervention mechanisms represented by a continuous function from A to Aq) if the action spaces are 
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continual To obtain a concrete result, we consider an intervention game where A4 = [0^,0$] C K with 
a, < Hi for all i G A/q and «j(ao, a) is strictly decreasing in ao on [a , ao] for all a G vl, for all i G N. 
We define an intervention mechanism f% c , for each a £ A and c G R , by 



/a,c(a) = [c(a-a)]£, 

for all a G A, where [x]a = min{max{x, a}, /3}. We call fa, c the o^rae intervention mechanism with 
target action profile a and intervention rate profile c. The following proposition constructs an affine 
intervention mechanism to support an interior target action profile in the case of differentiable utility 
functions. 

Proposition 4: Suppose that U{ is twice continuously differentiable for all i G M. Let a* £ Abe an 
action profile such that a* G (aj,aj) for all i G M, and let 

dui(a , a*) / da t 



dui(a ,a*)/da 

for all i G Afu Suppose that 

d 2 u. 



(6) 



da* 

for alii G M such that c* = 0, 

d 2 u t 



(«0' a «' a -i) — f° r a U a « ^ (o.i) a i) (^) 



(a ,aj,alj) < for all aj G (aj,a*), (8) 



da 2 

cP"tU (fill,; \ 

< 



da 2 l d ai da da\ j (oo>04)0 _ 4 ) = ( c j (a4 _ 0? ) +aoia4)a . .) 

for all aj G (a*, minjaj, a* + (ao — a )/c*}), (9) 

T^-(a , a*, alj-) < for all a; G (a* + (a - a )/c*, 5;) (10) 



for alH G M such that c* > 0, and 

: (a , a;, alj) > for all a» G (a^, a* + (a - a )/c*), 



Zl) da 2 + ^daidaa 8a 2 



< 

(ao,ai,Q-i)=(c* (a;— a*)+a ! a ij a -i) 



5 uuiua 

for all a; G (max{a;,a* + (ao — a )/c*}, a*), (11) 

4 Following 1101 . we say that an action space is a continuum if it is a compact and convex subset of the Euclidean space R K 
for some K. 



We define diii(a , a*)/dao as the right partial derivative of 114 with respect to ao at (a . 



a 
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d 2 m 



da? 



(fflO) a i) a -i) — f° r au a « ^ ( a i) a «) 



for alH G A/" such that c* < 00 Then f a *, c * supports a*. 

Proof: See the Appendix. ■ 
Note that du{ (a , a* ) /dao < for all i G A/" since itj is strictly decreasing in ao- Thus, c*, defined 
in ([6]), has the same sign as dui(a , a*) / da>i. With j4o = [oo,oo]> the action of the manager can be 
interpreted as the intervention level, and the regular users receive higher utility as the intervention level 
is smaller. The affine intervention mechanism f a * c * , constructed in Proposition HI has the properties that 
the manager chooses the minimal intervention level a when the regular users choose the target action 
profile a*, i.e., f a * tC *(a*) = Qq, and that the intervention level increases in the rate of \c*\ as regular 
user i deviates to the direction in which its utility increases at (a ,a*). The expression of c* in ([6]) 
has an intuitive explanation. Since c* is proportional to dui(a ,a*)/dai and inversely proportional to 
—dui(a Q , a*) /dao, a regular user faces a higher intervention rate as its incentive to deviate from (a , a*) 
is stronger and as a change in the intervention level has a smaller impact on its utility. The intervention 
level does not react to the action of regular user i when c* = 0, because regular user i chooses a* in its 
self-interest even when the intervention level is fixed at a , provided that other regular users choose a*_^ 
Finally, we note that if (/ a »,a*) is an intervention equilibrium and / a * c supports a* for some c, then 
(/ a « c ,a*) is also an intervention equilibrium, since f a *{a*) = / a *, c (a*) = £o- 

V. Application to Random Access Networks 

In this section, we illustrate the results of Section JV] by introducing a manager in a model of random 
access networks, similar to the model of |[T4l . Time is divided into slots of equal length, and a user 
can transmit its packet or wait in each slot. Due to interference, a packet is successfully transmitted 
only if there is no other packet transmitted in the current slot. If more than one packet is transmitted 
simultaneously, a collision occurs. The manager can transmit its packets as users do, and it interferes 
with all the users. 

We model the random access scenario as an intervention game. We assume that each user, including 
the manager, transmits its packets with a fixed probability over time. The action of user i, ai, is thus its 
transmission probability, and we have Ai = [0, 1] for all i G A/q. The average data rate for user i G M 



6 We define (a, /3) = if a > fi. 
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when the users transmit according to the probabilities (a ,a) G A) x A is given by 

r;(a , a) = 7*0* ] j (1 - aj), 

where ji > is the fixed peak data rate for user i. The benefit that a regular user i obtains from its average 
data rate is represented by a utility function Ui : M + — > R, which is assumed to be strictly increasing. 
Hence, the utility function of regular user i in the intervention game is given by itj(ao, a) = Ui(ri(ao, a)). 
We assume that the manager is benevolent with the utility function uo(«o> 0) = YlieM u i{ a o^ a ) an d that 
its monitoring technology is perfect. 

In the above intervention game, the minimal and maximal intervention actions are given by Oq = 
and oo = 1, respectively. Since n(Q,a) > and rj(l, a) = for all a G A, for all i G A/", we have 
£ = A by Proposition [TJ In other words, any action profile a G A can be supported by an intervention 
mechanism. Because the maximal intervention action yields zero rate to all the users regardless of the 
action profile, the maximum punishment is strong enough to prevent deviations from any target action 
profileO Since A = [0, 1]^ is compact and uq is continuous, a solution to max ae ^ uo(0, a) exists. Then 
by Proposition 121 (/ a »,a*) is an intervention equilibrium if and only if a* maximizes uq(0, a) on A. 
Also, we can apply Proposition [4] to show that any a* G (0, 1)^ is supported by / a *, c * with c* = 1/a* 
for all i G N. Suppose for the moment that the utility of each user is given by its average data rate, 
i.e., Uj(ao, a) = rj(ao, a) for all i G M. Then for each i G W, we obtain c* = 1/a* > by (O, and we 
can verify that the conditions (l8l)-([T0l) are satisfied. Thus, by Proposition 01 we can conclude that f a *,c* 
supports a*. Note that the concept of an intervention mechanism supporting an action profile is based on 
Nash equilibrium, which uses only the ordinal properties of the utility functions. Therefore, / * jC » still 
supports a* even when the utility function of user i is given by Ui(ao, a) = C/j(rj(ao, a)) for any strictly 
increasing function Ui, for all i G M. 

The above argument points out an informational advantage of direct intervention over indirect inter- 
vention. To highlight the informational advantage of direct intervention, suppose that the objective of the 
manager is to implement a target action profile a* G (0, 1)^, determined independently of (U\, ... ,Un), 
while taking the minimal intervention action when the users choose a*. Then the results in the previous 
paragraph imply that the direct intervention mechanisms f a * and / a *, c * with c* = 1/a* for all i G M 
support the action profile a* for any (U\, . . . , Un). Since direct intervention affects utility through its 



7 In fact, a maximum punishment intervention mechanism can prevent not only unilateral deviations from its target action 
profile but also joint deviations. 
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impact on the rates, its effectiveness does not depend on the shapes of the utility functions. Thus, the 
manager with direct intervention does not need to know the utility functions of the users, (Z7i, . . . , Un), 
in order to design an intervention mechanism that supports a target action profile^ This property can be 
considered as the robustness of direct intervention with respect to the utilities of the users. 

To draw a contrast, consider an alternative intervention scenario where the manager intervenes through 
pricing. In such a scenario, the action profile of the regular users determines their average data rates, i.e., 
ri{a) = 7j«j rijeAA{i}(^ ~~ a i) ^ or au * e while a pricing mechanism specifies the payments that the 
regular users make depending on their action profile, i.e., f(a) = (/i(a), . ■ ■ , /at (a)), where fi(a) is the 
payment of user i. Thus, the utility function of user i is given by 

Ui(f(a),a) = Ui(ri(a)) - /j(a). 

As can be seen from the above expression, pricing affects utility by taking away utility units from the 
users, and thus the shapes and scales of the utility functions matter to the manager when designing 
optimal pricing mechanisms. For example, consider a pricing mechanism that charges each user an 
amount proportional to its average data rate, i.e., fi(a) = PiTi{a) for all i S M, where pi is the price of 
unit data rate for user i. Then the pricing mechanism supports an action profile a* when the manager 
sets pi = U-(ri(a*)) for all i € M, assuming that £/j is differentiable and concave for all i G M. This 
example illustrates that, in contrast to direct intervention, the manager needs to know the utility functions 
of the users, (U\, . . . , Un), in order to design a pricing mechanism that supports a target action profile. 

VI. Conclusion 

This paper presents the intervention framework that is aimed to optimize the objective of the manager, 
or the network performance, taking the incentives of selfish users into account. We have highlighted 
the generality and advantages of our framework. In particular, we have pointed out the advantages of 
intervention over pricing in terms of implementation and informational requirement. To facilitate analysis 
in the intervention framework, we have developed a new class of games called intervention games and 
its solution concept called intervention equilibrium. Our analytic results in this paper are limited to 
the special case of a benevolent manager with perfect monitoring. It is our plan for future research to 
investigate intervention mechanisms in the case of imperfect monitoring, analyzing how errors in signals 
affect optimal intervention mechanisms and the network performance. 

8 Of course, if the target action profile depends on (Ui, . . . ,Un), the manager needs to know (Ui, . . . ,Un) to determine it. 
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Appendix 



Proof of Proposition @] 



Proof: To prove that /, 



supports a* is equivalent to show 



a* G arg max Ui(f a *,c*(<H, alj, a*, al J 



(12) 



for all 2 € A/". Note that f a *,c*{ a i, = [ c i(°i ~~ a D + o ]"°. We consider three cases depending on 
the sign of c*. 
Case 1: c* = 0. 

By ©, we have dui(a ,a*)/dai = 0. Also, we have / a * iC * (a$, a* J = a for all aj G -Aj. Thus, the 
objective function in (ITZb reduces to Uj(a , a^, alj. The condition (O implies that tii(orj, Oj, alj is a 
concave function with respect to a-i on ylj. Also, the first-order optimality condition is satisfied at aj = a* 
since dv,i(a , a*) / da-i = 0. Therefore, a* maximizes iij(a , a;, alj on Aj. 

Case 2: c* > 0. 

Since dui(a ,a*)/dao < 0, we have du^a^, a*) / dai > by (O. First, consider a, G [a^a*]. In this 
region, / a *, c * alj = a , and thus the objective function can be written as 1^(00, aj, alj. Since the 
condition © implies that Uj(a , ai,a*_ { ) is concave with respect to aj on [a,, a*], Ui(a , aj, alj is strictly 
increasing in a, on [a i; a*]. 

Second, consider a; G [a*, min{aj, a* + (a -a )/c*}]. In this region, / a *,c* ( a «) a -i) = c*(ai-a*)+a , 
and thus the objective function can be written as Uj(c*(aj — a*) + a , aj, alj). The first derivative of 
Ui(c*(di — a*) + Oq, aj, a^j) with respect to a, is given by 



while the second derivative is given by the left-hand side of (©. The first derivative is zero at a, = a* 
by ©, while the second derivative is non-positive by (O. Hence, the first derivative is non-positive 

on (a*, min{<2j, a* + (00 — Qo)/c*}), and thus tij(c?(aj — a*) + a , a^, a^) * s non-increasing in a, on 
[a*, min{ai, a* + (a - a )/c* }]■ 

Lastly, consider a» G [a* + (Sq — a )/c*,a"j]. m trns region, / a » )C « (a$, alj = ao, and thus the objective 
function can be written as Mj(ao, «i> a *j)- Since the first derivative of Ui(ao, a^, aHJ with respect to a^ 
is non-positive on (a* + (ao — a )/c* , aj) by (ITOb . Uj(ao, a^, alj is non-increasing in a, on [a* + (ao — 

ao)/c*' 5 *]- 

Case 3: c* < 0. 
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In this case, dui(a ,a*)/dai < and 

a for a,i £ [a,, a* + (a - a )/c*], 

c*(aj - a*) + a for otj G [max{a;, a* + (a - a )/c*}, a*], 
a for otj G [a*, a;]. 

Following an analogous argument as in Case 2, we can show that the objective function is non-decreasing 
in a, on [ctj, a*] and strictly decreasing on [a*, a^], implying that etj = a* maximizes the objective function 
on t4j. 

Note that if the inequalities in (0, d9]), and (fTTb are strict, we have = a* as a unique maximizer for 
all ieAf. m 
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